Curriculum Learning for Speech Emotion Recognition From Crowdsourced Labels
نویسندگان
چکیده
منابع مشابه
Learning Attributes from the Crowdsourced Relative Labels
Finding semantic attributes to describe related concepts is typically a hard problem. The commonly used attributes in most fields are designed by domain experts, which is expensive and time-consuming. In this paper we propose an efficient method to learn human comprehensible attributes with crowdsourcing. We first design an analogical interface to collect relative labels from the crowds. Then w...
متن کاملActive learning for dimensional speech emotion recognition
State-of-the-art dimensional speech emotion recognition systems are trained using continuously labelled instances. The data labelling process is labour intensive and time-consuming. In this paper, we propose to apply active learning to reduce according efforts: The unlabelled instances are evaluated automatically, and only the most informative ones are intelligently picked by an informativeness...
متن کاملFeature Transfer Learning for Speech Emotion Recognition
Speech Emotion Recognition (SER) has achieved some substantial progress in the past few decades since the dawn of emotion and speech research. In many aspects, various research efforts have been made in an attempt to achieve human-like emotion recognition performance in real-life settings. However, with the availability of speech data obtained from different devices and varied acquisition condi...
متن کاملRepresentation Learning for Speech Emotion Recognition
Speech emotion recognition is an important problem with applications as varied as human-computer interfaces and affective computing. Previous approaches to emotion recognition have mostly focused on extraction of carefully engineered features and have trained simple classifiers for the emotion task. There has been limited effort at representation learning for affect recognition, where features ...
متن کاملEmotion Recognition from Speech
This paper proposes the classification of emotions based on spectral features using the Gaussian Mixture Model as the classifier. The performance of the Gaussian Mixture Model has been evaluated for two types of databases – acted and reallife speech corpuses. The model has also been evaluated for the variation in its performance based on the speaker, gender of the speaker and the number of the ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE/ACM Transactions on Audio, Speech, and Language Processing
سال: 2019
ISSN: 2329-9290,2329-9304
DOI: 10.1109/taslp.2019.2898816